Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Lab Invest ; 104(3): 100304, 2024 03.
Artigo em Inglês | MEDLINE | ID: mdl-38092179

RESUMO

Gene expression profiling from formalin-fixed paraffin-embedded (FFPE) renal allograft biopsies is a promising approach for feasibly providing a molecular diagnosis of rejection. However, large-scale studies evaluating the performance of models using NanoString platform data to define molecular archetypes of rejection are lacking. We tested a diverse retrospective cohort of over 1400 FFPE biopsy specimens, rescored according to Banff 2019 criteria and representing 10 of 11 United Network of Organ Sharing regions, using the Banff Human Organ Transplant panel from NanoString and developed a multiclass model from the gene expression data to assign relative probabilities of 4 molecular archetypes: No Rejection, Antibody-Mediated Rejection, T Cell-Mediated Rejection, and Mixed Rejection. Using Least Absolute Shrinkage and Selection Operator regularized regression with 10-fold cross-validation fitted to 1050 biopsies in the discovery cohort and technically validated on an additional 345 biopsies, our model achieved overall accuracy of 85% in the discovery cohort and 80% in the validation cohort, with ≥75% positive predictive value for each class, except for the Mixed Rejection class in the validation cohort (positive predictive value, 53%). This study represents the technical validation of the first model built from a large and diverse sample of diagnostic FFPE biopsy specimens to define and classify molecular archetypes of histologically defined diagnoses as derived from Banff Human Organ Transplant panel gene expression profiling data.


Assuntos
Nefropatias , Transplante de Rim , Transplante de Órgãos , Humanos , Transplante de Rim/efeitos adversos , Estudos de Coortes , Estudos Retrospectivos , Rejeição de Enxerto/diagnóstico , Rejeição de Enxerto/genética , Nefropatias/patologia , Expressão Gênica , Biópsia , Rim/patologia
2.
Mol Ecol Resour ; 21(8): 2598-2613, 2021 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-33950563

RESUMO

Simulation-based methods such as approximate Bayesian computation (ABC) are well-adapted to the analysis of complex scenarios of populations and species genetic history. In this context, supervised machine learning (SML) methods provide attractive statistical solutions to conduct efficient inferences about scenario choice and parameter estimation. The Random Forest methodology (RF) is a powerful ensemble of SML algorithms used for classification or regression problems. Random Forest allows conducting inferences at a low computational cost, without preliminary selection of the relevant components of the ABC summary statistics, and bypassing the derivation of ABC tolerance levels. We have implemented a set of RF algorithms to process inferences using simulated data sets generated from an extended version of the population genetic simulator implemented in DIYABC v2.1.0. The resulting computer package, named DIYABC Random Forest v1.0, integrates two functionalities into a user-friendly interface: the simulation under custom evolutionary scenarios of different types of molecular data (microsatellites, DNA sequences or SNPs) and RF treatments including statistical tools to evaluate the power and accuracy of inferences. We illustrate the functionalities of DIYABC Random Forest v1.0 for both scenario choice and parameter estimation through the analysis of pseudo-observed and real data sets corresponding to pool-sequencing and individual-sequencing SNP data sets. Because of the properties inherent to the implemented RF methods and the large feature vector (including various summary statistics and their linear combinations) available for SNP data, DIYABC Random Forest v1.0 can efficiently contribute to the analysis of large SNP data sets to make inferences about complex population genetic histories.


Assuntos
Algoritmos , Genética Populacional , Teorema de Bayes , Simulação por Computador , Demografia , Polimorfismo de Nucleotídeo Único , Aprendizado de Máquina Supervisionado
3.
J Clin Med ; 10(4)2021 Feb 19.
Artigo em Inglês | MEDLINE | ID: mdl-33669799

RESUMO

The aim of the study was to verify the association of clinical relapses and brain activity with disability progression in relapsing/remitting multiple sclerosis patients receiving disease-modifying treatments in Poland. Disability progression was defined as relapse-associated worsening (RAW), progression independent of relapse activity (PIRA), and progression independent of relapses and brain MRI Activity (PIRMA). Data from the Therapeutic Program Monitoring System were analyzed. Three panels of patients were identified: R0, no relapse during treatment, and R1 and R2 with the occurrence of relapse during the first and the second year of treatment, respectively. In the R0 panel, we detected 4.6% PIRA patients at 24 months (p < 0.001, 5.0% at 36 months, 5.6% at 48 months, 6.1% at 60 months). When restricting this panel to patients without brain MRI activity, we detected 3.0% PIRMA patients at 12 months, 4.5% at 24 months, and varying from 5.3% to 6.2% between 36 and 60 months of treatment, respectively. In the R1 panel, RAW was detected in 15.6% patients at 12 months and, in the absence of further relapses, 9.7% at 24 months and 6.8% at 36 months of treatment. The R2 group was associated with RAW significantly more frequently at 24 months compared to the R1 at 12 months (20.7%; p < 0.05), but without a statistical difference later on. In our work, we confirmed that disability progression was independent of relapses and brain MRI activity.

4.
Nat Commun ; 11(1): 6161, 2020 12 02.
Artigo em Inglês | MEDLINE | ID: mdl-33268789

RESUMO

DNA 5-hydroxymethylcytosine (5hmC) modification is known to be associated with gene transcription and frequently used as a mark to investigate dynamic DNA methylation conversion during mammalian development and in human diseases. However, the lack of genome-wide 5hmC profiles in different human tissue types impedes drawing generalized conclusions about how 5hmC is implicated in transcription activity and tissue specificity. To meet this need, we describe the development of a 5hmC tissue map by characterizing the genomic distributions of 5hmC in 19 human tissues derived from ten organ systems. Subsequent sequencing results enabled the identification of genome-wide 5hmC distributions that uniquely separates samples by tissue type. Further comparison of the 5hmC profiles with transcriptomes and histone modifications revealed that 5hmC is preferentially enriched on tissue-specific gene bodies and enhancers. Taken together, the results provide an extensive 5hmC map across diverse human tissue types that suggests a potential role of 5hmC in tissue-specific development; as well as a resource to facilitate future studies of DNA demethylation in pathogenesis and the development of 5hmC as biomarkers.


Assuntos
5-Metilcitosina/análogos & derivados , Citosina/metabolismo , DNA/metabolismo , Elementos Facilitadores Genéticos , Epigênese Genética , Genoma Humano , Fatores de Transcrição/metabolismo , 5-Metilcitosina/metabolismo , Mapeamento Cromossômico , Ilhas de CpG , DNA/genética , Metilação de DNA , Histonas/genética , Histonas/metabolismo , Humanos , Especificidade de Órgãos , Fatores de Transcrição/genética , Ativação Transcricional
5.
Eur Neurol ; 83(5): 487-492, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33027785

RESUMO

INTRODUCTION: Presence of anti-JC-virus antibodies (JCVAbs) is associated with the increased risk of natalizumab (NAT)-related progressive multifocal leukoencephalopathy (PML). Little is known about seroconversion rate and time to seroconversion in relapsing-remitting multiple sclerosis (RRMS) patients treated with NAT in Poland. The aim of the study was to assess the true risk of PML, seroconversion rate, and time to seroconversion in all JCVAb-negative RRMS patients treated with NAT in Poland. METHODS: Demographic and clinical data of all Polish RRMS patients treated with NAT reimbursed by National Health Fund (NFZ) were prospectively collected in electronic files using the Therapeutic Programme Monitoring System provided by NFZ. The assessment of JCVAb presence (without collection of JCVAb index value) in serum (Unilabs, STRATIFY JCV: anti-JCV antibody ELISA) was done at the beginning of therapy and then repeated every 6 months. The maximum follow-up time was 4 years. In Poland, since 2013, according to the NFZ drug program guidance, only patients with negative JCVAb test have started treatment with NAT. RESULTS: In all Polish multiple sclerosis centers, 210 negative JCVAb RRMS patients with at least 9 (±3) months of observation (146 females, 64 males, and the median age at baseline: 33 years) were included in the study. During the follow-up period, JCVAb status changed from negative to positive in 34 patients (16.2%). For half of the patients, the seroconversion was diagnosed 1 year after starting NAT treatment. In 4 patients (1.9%) during follow-up, JCVAb status changed again from positive to negative. In Poland, before establishment of NFZ drug program, 4 cases of PML in patients treated with NAT in clinical trials were diagnosed. In the NFZ drug program, since 2013, no patient treated with NAT has been diagnosed with PML. CONCLUSIONS: NAT therapy in JCV-seronegative RRMS patients is safe and results in the absence of PML cases. In Poland, JCV seroconversion rate is similar to that observed in other European countries.


Assuntos
Fatores Imunológicos/efeitos adversos , Leucoencefalopatia Multifocal Progressiva/imunologia , Esclerose Múltipla Recidivante-Remitente/virologia , Natalizumab/efeitos adversos , Soroconversão , Adulto , Anticorpos Antivirais/sangue , Feminino , Humanos , Hospedeiro Imunocomprometido/imunologia , Vírus JC/imunologia , Leucoencefalopatia Multifocal Progressiva/epidemiologia , Masculino , Esclerose Múltipla Recidivante-Remitente/tratamento farmacológico , Polônia , Adulto Jovem
6.
Nat Commun ; 11(1): 5270, 2020 10 19.
Artigo em Inglês | MEDLINE | ID: mdl-33077732

RESUMO

Pancreatic cancer is often detected late, when curative therapies are no longer possible. Here, we present non-invasive detection of pancreatic ductal adenocarcinoma (PDAC) by 5-hydroxymethylcytosine (5hmC) changes in circulating cell free DNA from a PDAC cohort (n = 64) in comparison with a non-cancer cohort (n = 243). Differential hydroxymethylation is found in thousands of genes, most significantly in genes related to pancreas development or function (GATA4, GATA6, PROX1, ONECUT1, MEIS2), and cancer pathogenesis (YAP1, TEAD1, PROX1, IGF1). cfDNA hydroxymethylome in PDAC cohort is differentially enriched for genes that are commonly de-regulated in PDAC tumors upon activation of KRAS and inactivation of TP53. Regularized regression models built using 5hmC densities in genes perform with AUC of 0.92 (discovery dataset, n = 79) and 0.92-0.94 (two independent test sets, n = 228). Furthermore, tissue-derived 5hmC features can be used to classify PDAC cfDNA (AUC = 0.88). These findings suggest that 5hmC changes enable classification of PDAC even during early stage disease.


Assuntos
5-Metilcitosina/análogos & derivados , Ácidos Nucleicos Livres/metabolismo , Neoplasias Pancreáticas/genética , 5-Metilcitosina/metabolismo , Adulto , Biomarcadores Tumorais/sangue , Biomarcadores Tumorais/metabolismo , Ácidos Nucleicos Livres/sangue , Ácidos Nucleicos Livres/genética , Estudos de Coortes , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Feminino , Fator de Transcrição GATA4/genética , Fator de Transcrição GATA4/metabolismo , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/metabolismo , Humanos , Masculino , Pessoa de Meia-Idade , Estadiamento de Neoplasias , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Neoplasias Pancreáticas/sangue , Neoplasias Pancreáticas/metabolismo , Neoplasias Pancreáticas/patologia , Fatores de Transcrição de Domínio TEA , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Proteínas Supressoras de Tumor/genética , Proteínas Supressoras de Tumor/metabolismo , Neoplasias Pancreáticas
7.
Neurol Neurochir Pol ; 54(2): 161-168, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32219813

RESUMO

AIM OF STUDY: The aim of this study was to collect and analyse data on relapsing-remitting multiple sclerosis (RRMS) patients receiving disease-modifying therapies (DMTs) in Poland. MATERIAL AND METHODS: This observational, multicentre study with prospective data collection included RRMS patients receiving DMTs reimbursed by the National Health Fund (NFZ) in Poland, monitored by the Therapeutic Programme Monitoring System (SMPT). Demographic profiles, disability status, and treatment modalities were analysed. RESULTS: Data from 11,632 RRMS patients was collected (from 15,368 new prescriptions), including 10,649 patients in the first-line and 983 in the second-line therapeutic programme of DMTs. The proportion of females to males was 2.39 in the first-line and 1.91 in the second-line. The mean age at DMTs start was 36.6 years in the first-line and 35.1 in the second-line. The median time from the first symptoms to MS diagnosis was 7.4 months, and from MS diagnosis to treatment it was 18.48 months. A total of 43.4% of MS patients started DMT during the 12 months following diagnosis. There was a positive correlation between the duration from MS diagnosis to the start of DMT and a higher initial EDSS value [correlation 0.296 (p < 0.001)]. About 10% of patients stopped DMTs. In Poland, about one third of all MS patients are treated in both lines, and the choice of first-line treatment depends on the region of the country. CONCLUSIONS: In Poland there is a need to increase MS patient access to DMTs by improving the organisation of drug programmes.


Assuntos
Esclerose Múltipla , Adulto , Feminino , Humanos , Masculino , Polônia , Estudos Prospectivos
8.
Int J Mol Sci ; 21(5)2020 Mar 02.
Artigo em Inglês | MEDLINE | ID: mdl-32131491

RESUMO

Recent studies have led to considerable advances in the identification of genetic variants associated with type 1 and type 2 diabetes. An approach for converting genetic data into a predictive measure of disease susceptibility is to add the risk effects of loci into a polygenic risk score. In order to summarize the recent findings, we conducted a systematic review of studies comparing the accuracy of polygenic risk scores developed during the last two decades. We selected 15 risk scores from three databases (Scopus, Web of Science and PubMed) enrolled in this systematic review. We identified three polygenic risk scores that discriminate between type 1 diabetes patients and healthy people, one that discriminate between type 1 and type 2 diabetes, two that discriminate between type 1 and monogenic diabetes and nine polygenic risk scores that discriminate between type 2 diabetes patients and healthy people. Prediction accuracy of polygenic risk scores was assessed by comparing the area under the curve. The actual benefits, potential obstacles and possible solutions for the implementation of polygenic risk scores in clinical practice were also discussed. Develop strategies to establish the clinical validity of polygenic risk scores by creating a framework for the interpretation of findings and their translation into actual evidence, are the way to demonstrate their utility in medical practice.


Assuntos
Diabetes Mellitus Tipo 1/genética , Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Humanos , Herança Multifatorial
9.
Cancers (Basel) ; 12(1)2019 Dec 21.
Artigo em Inglês | MEDLINE | ID: mdl-31877723

RESUMO

Non-small-cell lung cancer (NSCLC) represents a heterogeneous group of malignancies consisting essentially of adenocarcinoma (ADC) and squamous cell carcinoma (SCC). Although the diagnosis and treatment of ADC and SCC have been greatly improved in recent decades, there is still an urgent need to identify accurate transcriptome profile associated with the histological subtypes of NSCLC. The present study aims to identify the key dysregulated pathways and genes involved in the development of lung ADC and SCC and to relate them with the clinical traits. The transcriptional changes between tumour and normal lung tissues were investigated by RNA-seq. Gene ontology (GO), canonical pathways analysis with the prediction of upstream regulators, and weighted gene co-expression network analysis (WGCNA) to identify co-expressed modules and hub genes were used to explore the biological functions of the identified dysregulated genes. It was indicated that specific gene signatures differed significantly between ADC and SCC related to the distinct pathways. Of identified modules, four and two modules were the most related to clinical features in ADC and SCC, respectively. CTLA4, MZB1, NIP7, and BUB1B in ADC, as well as GNG11 and CCNB2 in SCC, are novel top hub genes in modules associated with tumour size, SUVmax, and recurrence-free survival. Our research provides a more effective understanding of the importance of biological pathways and the relationships between major genes in NSCLC in the perspective of searching for new molecular targets.

10.
PLoS One ; 14(10): e0223863, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31647829

RESUMO

OBJECTIVE: The aim of the study was to assess the effectiveness of disease-modifying therapies (DMTs) in relapsing-remitting multiple sclerosis (RRMS) patients treated in MS centres in Poland. METHODS: Demographic and clinical data of all Polish RRMS patients receiving DMTs were prospectively collected from 2014 to 2018 in electronic files using the Therapeutic Program Monitoring System (SMPT). RESULTS: The study included 10,764 RRMS patients treated with DMTs in first-line and 1,042 in second-line programmes. IFNß more effectively lengthened the times to the first relapse, disability progression, and brain MRI activity than GA. After 2 and 4 years of follow-up, more patients on IFNß showed no evidence of disease activity (NEDA-3) in comparison to GA (66.3% and 44.3% vs 55.2% and 33.2%, respectively; p<0.001). NAT more effectively reduced brain MRI activity than FTY (p = 0.001). More patients under NAT had NEDA-3 after 2 and 4 years of follow-up compared to FTY (66.2% and 42.1% vs 52.1% and 29.5%, respectively; p = 0.03). In adjusted analysis, a higher baseline Expanded Disability Status Score (EDSS) was a predictor of relapse (p<0.001) and NEDA-3 failure (p = 0.003). CONCLUSION: IFNß compared to GA and NAT compared to FTY more effectively reduced disease activity in a Polish population of RRMS patients.


Assuntos
Cloridrato de Fingolimode/uso terapêutico , Acetato de Glatiramer/uso terapêutico , Interferon beta/uso terapêutico , Esclerose Múltipla Recidivante-Remitente/tratamento farmacológico , Natalizumab/uso terapêutico , Adulto , Quimioterapia Combinada , Feminino , Seguimentos , Humanos , Fatores Imunológicos/uso terapêutico , Imunossupressores/uso terapêutico , Masculino , Esclerose Múltipla Recidivante-Remitente/epidemiologia , Esclerose Múltipla Recidivante-Remitente/patologia , Polônia/epidemiologia , Prognóstico , Estudos Prospectivos , Taxa de Sobrevida
11.
PLoS One ; 7(7): e40092, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22808097

RESUMO

RNA biomarkers discovered by RT-PCR-based gene expression profiling of archival formalin-fixed paraffin-embedded (FFPE) tissue form the basis for widely used clinical diagnostic tests; however, RT-PCR is practically constrained in the number of transcripts that can be interrogated. We have developed and optimized RNA-Seq library chemistry as well as bioinformatics and biostatistical methods for whole transcriptome profiling from FFPE tissue. The chemistry accommodates low RNA inputs and sample multiplexing. These methods both enable rediscovery of RNA biomarkers for disease recurrence risk that were previously identified by RT-PCR analysis of a cohort of 136 patients, and also identify a high percentage of recurrence risk markers that were previously discovered using DNA microarrays in a separate cohort of patients, evidence that this RNA-Seq technology has sufficient precision and sensitivity for biomarker discovery. More than two thousand RNAs are strongly associated with breast cancer recurrence risk in the 136 patient cohort (FDR <10%). Many of these are intronic RNAs for which corresponding exons are not also associated with disease recurrence. A number of the RNAs associated with recurrence risk belong to novel RNA networks. It will be important to test the validity of these novel associations in whole transcriptome RNA-Seq screens of other breast cancer cohorts.


Assuntos
Neoplasias da Mama/genética , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Recidiva Local de Neoplasia/genética , Inclusão em Parafina , Análise de Sequência de RNA , Fixação de Tecidos , Sequência de Bases , Biomarcadores Tumorais/genética , DNA Intergênico/genética , Feminino , Formaldeído , Humanos , Íntrons/genética , Modelos de Riscos Proporcionais , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA Neoplásico , Fatores de Risco
12.
BMC Bioinformatics ; 7: 211, 2006 Apr 19.
Artigo em Inglês | MEDLINE | ID: mdl-16623940

RESUMO

BACKGROUND: Gene expression microarray experiments are expensive to conduct and guidelines for acceptable quality control at intermediate steps before and after the samples are hybridised to chips are vague. We conducted an experiment hybridising RNA from human brain to 117 U133A Affymetrix GeneChips and used these data to explore the relationship between 4 pre-chip variables and 22 post-chip outcomes and quality control measures. RESULTS: We found that the pre-chip variables were significantly correlated with each other but that this correlation was strongest between measures of RNA quality and cRNA yield. Post-mortem interval was negatively correlated with these variables. Four principal components, reflecting array outliers, array adjustment, hybridisation noise and RNA integrity, explain about 75% of the total post-chip measure variability. Two significant canonical correlations existed between the pre-chip and post-chip variables, derived from MAS 5.0, dChip and the Bioconductor packages affy and affyPLM. The strongest (CANCOR 0.838, p < 0.0001) correlated RNA integrity and yield with post chip quality control (QC) measures indexing 3'/5' RNA ratios, bias or scaling of the chip and scaling of the variability of the signal across the chip. Post-mortem interval was relatively unimportant. We also found that the RNA integrity number (RIN) could be moderately well predicted by post-chip measures B_ACTIN35, GAPDH35 and SF. CONCLUSION: We have found that the post-chip variables having the strongest association with quantities measurable before hybridisation are those reflecting RNA integrity. Other aspects of quality, such as noise measures (reflecting the execution of the assay) or measures reflecting data quality (outlier status and array adjustment variables) are not well predicted by the variables we were able to determine ahead of time. There could be other variables measurable pre-hybridisation which may be better associated with expression data quality measures. Uncovering such connections could create savings on costly microarray experiments by eliminating poor samples before hybridisation.


Assuntos
Artefatos , Perfilação da Expressão Gênica/instrumentação , Modelos Genéticos , Análise de Sequência com Séries de Oligonucleotídeos/instrumentação , Garantia da Qualidade dos Cuidados de Saúde/métodos , Simulação por Computador , Interpretação Estatística de Dados , Desenho de Equipamento , Análise de Falha de Equipamento , Modelos Estatísticos , Controle de Qualidade , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
13.
Hum Mol Genet ; 15(6): 965-77, 2006 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-16467349

RESUMO

Huntington's disease (HD) pathology is well understood at a histological level but a comprehensive molecular analysis of the effect of the disease in the human brain has not previously been available. To elucidate the molecular phenotype of HD on a genome-wide scale, we compared mRNA profiles from 44 human HD brains with those from 36 unaffected controls using microarray analysis. Four brain regions were analyzed: caudate nucleus, cerebellum, prefrontal association cortex [Brodmann's area 9 (BA9)] and motor cortex [Brodmann's area 4 (BA4)]. The greatest number and magnitude of differentially expressed mRNAs were detected in the caudate nucleus, followed by motor cortex, then cerebellum. Thus, the molecular phenotype of HD generally parallels established neuropathology. Surprisingly, no mRNA changes were detected in prefrontal association cortex, thereby revealing subtleties of pathology not previously disclosed by histological methods. To establish that the observed changes were not simply the result of cell loss, we examined mRNA levels in laser-capture microdissected neurons from Grade 1 HD caudate compared to control. These analyses confirmed changes in expression seen in tissue homogenates; we thus conclude that mRNA changes are not attributable to cell loss alone. These data from bona fide HD brains comprise an important reference for hypotheses related to HD and other neurodegenerative diseases.


Assuntos
Encéfalo/metabolismo , Perfilação da Expressão Gênica , Doença de Huntington/genética , Doença de Huntington/metabolismo , Adulto , Idoso , Axônios/metabolismo , Encéfalo/patologia , Morte Celular/genética , Feminino , Humanos , Doença de Huntington/patologia , Masculino , Pessoa de Meia-Idade , Análise de Sequência com Séries de Oligonucleotídeos , RNA Mensageiro/biossíntese , Transdução de Sinais/genética
14.
Biostatistics ; 4(2): 249-64, 2003 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-12925520

RESUMO

In this paper we report exploratory analyses of high-density oligonucleotide array data from the Affymetrix GeneChip system with the objective of improving upon currently used measures of gene expression. Our analyses make use of three data sets: a small experimental study consisting of five MGU74A mouse GeneChip arrays, part of the data from an extensive spike-in study conducted by Gene Logic and Wyeth's Genetics Institute involving 95 HG-U95A human GeneChip arrays; and part of a dilution study conducted by Gene Logic involving 75 HG-U95A GeneChip arrays. We display some familiar features of the perfect match and mismatch probe (PM and MM) values of these data, and examine the variance-mean relationship with probe-level data from probes believed to be defective, and so delivering noise only. We explain why we need to normalize the arrays to one another using probe level intensities. We then examine the behavior of the PM and MM using spike-in data and assess three commonly used summary measures: Affymetrix's (i) average difference (AvDiff) and (ii) MAS 5.0 signal, and (iii) the Li and Wong multiplicative model-based expression index (MBEI). The exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values. We evaluate the four expression summary measures using the dilution study data, assessing their behavior in terms of bias, variance and (for MBEI and RMA) model fit. Finally, we evaluate the algorithms in terms of their ability to detect known levels of differential expression using the spike-in data. We conclude that there is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities.


Assuntos
Sondas de DNA/genética , Interpretação Estatística de Dados , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Algoritmos , Animais , Perfilação da Expressão Gênica/estatística & dados numéricos , Humanos , Modelos Lineares , Camundongos , Distribuição Normal , Reprodutibilidade dos Testes , Estatísticas não Paramétricas
15.
Nucleic Acids Res ; 31(4): e15, 2003 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-12582260

RESUMO

High density oligonucleotide array technology is widely used in many areas of biomedical research for quantitative and highly parallel measurements of gene expression. Affymetrix GeneChip arrays are the most popular. In this technology each gene is typically represented by a set of 11-20 pairs of probes. In order to obtain expression measures it is necessary to summarize the probe level data. Using two extensive spike-in studies and a dilution study, we developed a set of tools for assessing the effectiveness of expression measures. We found that the performance of the current version of the default expression measure provided by Affymetrix Microarray Suite can be significantly improved by the use of probe level summaries derived from empirically motivated statistical models. In particular, improvements in the ability to detect differentially expressed genes are demonstrated.


Assuntos
Sondas de DNA/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Sistema Nervoso Central/metabolismo , Sondas de DNA/normas , Perfilação da Expressão Gênica , Humanos , Fígado/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos/normas , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Reprodutibilidade dos Testes , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...